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Abstract —It is known that the coupling lenuna |1| pro¬ 
vides a useful tool in the study of prohablllty theory and 
its related areas; it describes the relation between the 
variational distance of two probability distributions and the 
probability that outcomes from the two random experiments 
associated with each distribution are not identical. In this 
paper, the failure probability Interpretation problem that 
has been presented by Yuen and Hirota is discussed from the 
viewpoint of the application of the coupling lemma. First, 
we introduce the coupling lemma, and investigate properties 
of it. Next, it is shown that the claims for this problem in 
the literatures (TOl, Hll are justified by using the coupling 
lemma. Consequently, we see that the failure probability 
interpretation is not adequate in the security analysis of 
quantum key distribution. 

I. Introduction 

In the theoretical studies on quantum key distribution 
(QKD) done in the last decade, the trace distance criterion 
is widely used for evaluating the QKD system for one¬ 
time pad {e.g. m, 0, a, Q, 0, Q, 0, 0). In the 
trace distance criterion, the security notion “e-secure” is 
dehned by 

d=]^\\PKE-h® PE\\<e, (1) 

where corresponds to the real system of the key 

and environment, and pu (8) pe the ideal one. Further, it 
is claimed that the parameter e is interpreted as the so- 
called (maximal) failure probability of the QKD protocol. 
For this failure probability interpretation of the parameter 
£, Yuen and Hirota have respectively presented their 
objection, together with various other theoretical lack in 
the security analysis of QKD {e.g. ifTOll . ifTTll . lIT^ . lfT3l . 
ina, csi, csi, htd. In this paper, we treat this failure 
probability interpretation problem. 

Tracking back through the development history of the 
trace distance criterion to seek the origin of the failure 
probability interpretation, one can arrive at Lemma 1 
of the literature 0: “'Let P and Q be two probability 
distributions. Then there exists a joint probability dis¬ 
tribution Pxx' such that Px = P, Px' — Q, ttnd 

^ x'\ = S{P,Qy\ where S{P,Q) is 
the variational distance between P and Q. Based on this 
lemma, they give the failure probability interpretation 
to £. However, Yuen has repeatedly claimed that a “for 
every” statement would be needed rather than the “there 


exists” statement if it justihes the interpretation, and 
basically, there is no cause one must choose such favor¬ 
able distribution other than the independent distribution 
Pxx' = P Q (nni, im and its subsequent papers). 
But, despite the series of criticisms, the failure probability 
interpretation is still kept 0. This situation motivates 
us to discuss on the failure probability interpretation 
problem. 

To discuss the failure probability interpretation prob¬ 
lem from a new point of view, we employ a lemma called 
the coupling lemma, which was given by Aldous m. 
This lemma describes the relation between the variational 
distance of two probability distributions and the proba¬ 
bility that outcomes from the two random experiments 
associated with each distribution are not identical. As we 
will show later, the use of the coupling lemma would 
provide a clear perspective on this problem. 


H. Coupling Lemma 

We first introduce the coupling lemma, together with 
the dehnition of the variational distance of two probabil¬ 
ity distributions. The original statement of the coupling 
lemma is found in the literature IT] (Lemma 3.6, p.249). 


Definition 1 (variational distance): Let X and Y be 
random variables that take on values from a hnite al¬ 
phabet A = {ai, 02,..., otv}- Let Px and Py denote 
probability distributions of X and Y, respectively. The 
variational distance between Px and Py is defined by 


v{Px,Py) 


max 


Px{S) 


Py{S) 


or equivalently, by 

v{Px,Py) = i ^|Px(a) -PY(a) 

a^A 


( 2 ) 

(3) 


Definition 2 (coupling): There are two probability dis¬ 
tributions Px and Py that are dehned on a hnite set 
A. Consider a joint probability distribution Pxy on A^ 
whose marginal distributions are Px and Py. We call 
this Pxy a coupling of Px and Py. ■ 





Theorem 3 (coupling lemma JTl): Suppose that Px and 
Py are given. 

(a) For every coupling Pxy of Px and Py, 

v{Px,Py) <Vr{x y}. (4) 

(b) There exists a coupling Pxy such that 

v{Px.Py) = ^r{x^y}. ( 5 ) 

□ 

Proof: One would be able to find the coupling lemma 
and its proof in the textbooks on probability theory and 
its applications {e.g. Km, mi, iioi). The following proof 
is obtained by modifying the proof of Theorem A.6 of 
the literature i). 

(a) Let Pxy be an arbitrarily chosen coupling of Px 
and Py. It is clear that 

Pxy {a, a) < Px{a) and Pxy {a, a) < Py{a) 

for every a G A, because of the relationship between a 
joint probability distribution and the associated marginal 
distributions. So, we have 

Pxy{a,a) <mm{Px{a),Py{a)} \/a G A. (6) 

Note that the converse of the inequality above, 
PxY{a,o.) > min{Px(o),Pyja)}, is never established. 
Summing up the both sides of Eq.® with respect to a, 
we have 

PT{x = y} = ^ Pxy {a, a) 

aGA 

< min{Px{a),Py{a)}. (7) 

aGA 

This inequality is arranged to the following form. 

Pr{x f y} = 1 — Prja; = y} 

> 1 - mm{Px{a),Py{a)} 

aGA 

= y] [Px (a) - minlPjf (a), Py (a)} . 

aGA 

( 8 ) 


Here let us define the following partition of A: 

B = {h:Pxib)>Py{b)}, 

B = {b:Px{b) <Py{b)}. 


Then 

Eq.® 


J2[Pxib)-min{Px{b),Py{b)} 

bGB 

+ y^[Px(&) - min{Px(&),Py(&)} 

bGB 


Y\Px{b)-Py{b) 

bGB 

Px{B)-Py{B) 

max Px(S) — PyiS) 
5C.4L . 


v{Px,Py)- 


(9) 


This completes the proof for (a). 

(b) In the preceding part, we observed that the inequal¬ 
ity I® follows from Eq.®. This implies that the equality 
in Eq.® is established when the equality in Eq.® holds. 
The question is whether there exists a coupling Pxy such 
that 


Pxy(a, a) = min{Px(a), Py(a)} Va G A. (10) 


The existence of such a coupling can be shown by the 
following constructive method. 

step 1. Define Pxy{a,a) by Ea.ifTOll. It is clear that 
Pxy {a, a) > 0 for every a G A. 

step 2. If v{Px, Py) = 0, it means that the two random 
variables, X and Y, are equal: X = Y. In this case, we 
let PxY{a,b) = 0 for (a,b) S A^ such that a f b. 
Clearly, Prja; ^ y} = 0 = v{Px,Py), and 


y^ PxY{a,y) = Px{a) = Py(a) = PxY{x,a) 

VGA xGA 


for every a G A. 

Next let us consider the case for v{Px,Py) ^ 0. In 
this case, the two random variables, X and Y, are not 
equal: X fY. This implies Prjx y} > 0. Then we 
define 


Pxy {a, b) 


Rx{a)RY{b) 
Pr{x y} 


V(a, b) G A'^ s.t. afb, 

( 11 ) 


where the functions in the numerator are given by 


Rx{a) = Px{a) — PxY{a,a) '\fa G A, (12) 
RY{b) = Py{b)-PxY{b,b) V&eA (13) 

Since Rx{a) > 0 and RY{b) > 0, we see that 
PxY{cL,b) > 0 for every (a, 6) G A?. In addition, we 
observe that 


Px (a) = X! Py((>) = Pr{x 7 ^ y} (14) 


and 


Px(a)Py(a) = Px(&)Py(&) = 0 Va, & e A (15) 


By using Eqs. (Ell and dEll, we can verify that the 
constructed joint probability distribution has the given 
distributions Px and Py as the associated marginal 











distributions. For any fixed a G A 
'^PxY{a,b) 


b€A 


= X! PxY{a,b) + ^ PxY{a,b) 

b:b^a 


b-.b—a 


Rx{a)RY{b) , n ^ ^ 

= _z „.i -^ PxY[a, a) 


b\b^a 

Rx{a 


Prja: ^ y} 


Pr{x ^ y} 


^ i?y( 6 ) j +PxYia,a) 


ib:b^a 


Pyi^) - Rvia)^ + PxY{a, a) 


Pr{ 

Rxja) 
Pr{x ^ y) 


(Pr{x ^y} - Rxia)) + PxY^a, a) 


„ , , Rx{a)RY{a) „ , . 

= Rxia) - p r / . +Pyy(a,a) 

Prja; y\ 


Taking this fact into account, we have 

1 - ^ min{Px(a),Py(a)} < 1 - X! Px{a)PY{a), 
a^A a^A 

when at least one a' G A satisfies the conditions 0 < 
Px{a!) < 1 and 0 < Rxia') < 1. With the help of the 
calculation of Eq.®, the above inequality is summarized 
to the following statement; 

Corollary to Theorem 3: Suppose that X and Y are 
independent, and at least one a' G A satisfies the 
conditions 0 < Px{a!) < 1 and 0 < PY{a') < 1. Then, 

v{Px,PY)<Pr{x^y}. (17) 


The next example was used in the literatures cni, 
im to explain the incorrectness of the failure probability 
interpretation. 


= Rx (a) + PxY (a, a) 

= Px(a). 

With the same manner, we also have 

Y,PxY{a,b)=PY{b) 'ibGA. 

a^A 

Thus, it was shown that the joint probability distribution 
PxY constructed from Eas. lfTOl l and (fTTI) has the marginal 
distributions Px and Py. Since Ea. ifTOl l holds, we have 
v{Px,Py) = Pr{x ^ y} with this constructed distribu¬ 
tion PxY- ® 

A concrete example of couplings of Px and Py is 
shown in the appendix A. This illustrates the coupling 
lemma (Theorem 3) in a case of X ^ Y. 

It should be emphasized that the coupling lemma 
consists of two parts: (a) “for every” part and (b) “there 
exists” part. Lemma 1 of the literature m, Proposition 
2.1.1 of the literature a, and Theorem A.6 of the 
literature 0, these are essentially identical to the “there 
exists” part of the coupling lemma. On the other hand, 
an example of independent joint probability distribution, 
which was treated in the literatures Eoi, mi, can be 
explained by the “for every” part of the coupling lemma. 
Eor the later discussion, we treat the independent joint 
distribution case here again. 

Suppose that X and Y are independent. Then the joint 
probability distribution Pxy is given by 

PxxiaA) = Px{a)Py{b) 'i{a,b)cA^. 

Since 0 < Px{a) < 1 and 0 < Py(b) < 1, we have 

Px{a)Py{a) < inm{Px{a),Py{a)} Va G A. 

If some a' G A satisfies the conditions 0 < Px{a') < 1 
and 0 < Py{a') < 1, then it is reduced to 

Px{a')Py{a') < min{Px(a'),-Pv(a')}- (16) 


Example 4 ( ifTOl . ifTTI '): Let A = {1,2,...,W}, and 
Px{x) = f/N f/x G A and Py{y) = f/N My G A. Sup¬ 
pose that X and Y are independent. Then Pxy{x,y) = 
Px{x)Py{y) = l/N"^ for every (a;, y) G A^. In this case, 
we have 

v{Px,Py) = 0 < 1 - ^ = Prja: ^ y} 

for N >2. ■ 

Up to this point, we have considered the case of one- 
dim random variables. It is easy to extend the case of 

v{Px,Py) to that of u(PxiJf 2 ,-FViYs)- 

Theorem 5: Let {Xi,X 2 ) and (Ti,T2) are two-dim 
random variables that take on values from the same finite 
alphabet where A = joi, 02, ..., ox}. Suppose that 
PxiX 2 and Py^y^ are given. 

(a) For every coupling Px^x^y^y^ of PxtX^ and PY^Y 2 , 

v{PxrX2, PyiY2) < Pr{(xi,X2) ^ iyi,y2)} ( 18 ) 
where 

v{PxiX2,PyiY2) 

= 2 ^ (g, b) - Py^y^ (g, b) . 

aGAb£A 

(b) There exists a coupling PX 1 X 2 Y 1 Y 2 such that 

v{Px^X 2 , PyiY 2 ) = Pr{(a;i,a: 2 ) ^ ( 2 / 1 , J/ 2 )}- (19) 

□ 

Proof: This is due to the inequality 

PxiX2YiY2{a,b,a,b) 

< uim{Px^X2{a,b),PYiY2ia,b)} ( 20 ) 

for every (a, b) G A^. ■ 








A concrete example of couplings of Px^x^ PyiY 2 
is shown in the appendix B. This illustrates the coupling 
lemma for two-dim random variables (Theorem 5) in a 
case of (Xi,X 2 )^(ri,y 2 ). 

The reader might recall a description on the vari¬ 
ational distance for two-dim random variables {X,X) 
and (X, Y) in the textbook 11211 of Nielsen and Chuang 
(p.402). From the point of view of the coupling lemma, it 
can be understood as a special case that has the condition 
Xi = X 2 = Fi. 

Corollary to Theorem 5: When the condition Xi = X 2 = 
Yi is imposed, then 


Eq.dllll, 

PYiY2{a,b) = PxiX2YiY2ia,a,a,b) 

> 0 

= PxiX2{a,b) 

= PxiX2YiY2ia,b,a,b). (26) 

This yields 

PxiX 2 YiY 2 {a, b, a, b) 

= PxiX2{a,b) 

= inm{PxiX2{a,b),PY^Y2{a,b)}. (27) 

Summarizing Eas. (l25]) and (l27l) . 


viPxiX2, PY1Y2) = Pr{a:2 2/2}- 


( 21 ) 


□ 

Proof. Erom the condition Xi = X 2 — Fi, we have 
PxiX 2 YiY 2 {xi,X 2 ,yi,b) = 0 if a ;2 ^ Xi or yi ^ xi. 
Hence, for every a,b ^ A, 


PxiY2{a, b) 

Px2Y2{a, b) 

PYiY2{a, b) 

If a = 6 , then 
PxiX2{a,b) 


E E PxiX2YiY2ia,X2,yi,b) 

x 2 €A yi€A 

PxiX2YiY2(a,a,a,b)', (22) 

E E PxiX2YiY2(xi,a,yi,b) 

xiGA yi€A 

PxiX2YiY2(a,a,a,b)', (23) 

E E PxiX2YiY2(xi,X2,a, b) 

X\^A X2^A 

PxiX2YiY2ia,a,a,b). (24) 


PxiX2{a,a) 

= E E PxiX 2 YiY 2 {a, a, 2 / 1 , 2 / 2 ) 

yi&A y 2 &A 

= E PxiX2YiY2ia,a,a,y2) 

y2&A 

> -PxiXsFiYsCa,®, 0,2/2) '^y'2 ^ A. 

This implies 

PxxX 2 {o.,h) > PxiX 2 YiY 2 { 0 ', a, a,b) if a = 6 . 
Substituting Ea. (l24l i into this, we have 

-PxiX2(o,o) > PxiX2YiY2{a,a,a,a) = PY^Y2{a,a). 

Therefore, 


PxiX2YiY2{a,a,a,a) 

= PYiY2{a,a) 

= mm{PxiX2{a,a),PYiY2{a,a)}- (25) 

Next we assume that a 7 ^ 6 . It is obvious that 
PxiX 2 YiY 2 {a,b,a,b) = 0 = PxiXalo, 6 )- By using 


PxiX2YiY2{a,b,a,b) 

= mm{Px^X2{a,b),PY^Y2ia,b)} (28) 

for every (a, b) G A^. Erom the comparison between this 
and Ea. (l 2 ^ . we hnd that 

v{PxiX2,PyiY2) =Pr{(xi,X2) ^ (2/1,^2)}- 

Moreover, since Xi = X 2 = Fi, the probability 
Pr{(xi,X 2 ) 7 ^ ( 2 / 1 , 2 / 2 )} is reduced to Pr{x 2 7 ^ 2 / 2 }- 
Thus, we proved that Xi = X 2 = Yi yields the equality 
of v{PxiX 2 ,PyiY 2 ) and Pr{x 2 7 ^ 2 / 2 }. ■ 

Pinally, let us summarize the relationship between the 
variational distance and the probability of x y. 

1) The variational distance v{Px,Py) does not 
always mean the probability Pr{x f y}. 
v{PxiX 2 J PY 1 Y 2 ) too. Basically, the variational dis¬ 
tance v{Px, Py) is a lower bound of Pr{x y}. 

2) Suppose that the two random variables X and F 
are independent, and some a' G A satishes the 
conditions 0 < Px{a') < 1 and 0 < Py^A) < 1. 
In this case, we have v{Px, Py) < Pr{x y}. 

3) Conversely, a maximal coupling of Px and Py 
that yields v{Px,Py) = Pr{x 7 ^ y} demands 
correlation between the outcomes of X and F. 

III. On The Pailure Probability Interpretation 
Eor e 

Let us return to the main course of this paper. In 
the security analysis of QKD under the trace distance 
criterion, as mentioned in the section I, “^-secure” is 
defined by 

d= ^WPke - Pu Pe\\ <£■ 

The problem that Yuen and Hirota have discussed and 
we discuss here is whether or not the parameter e has a 
meaning of “the failure probability”. 

In the literatures m, 11, 0, 0, 0, 0, 0, 0, 

it is claimed that the operational meaning of e is the 
(maximal) failure probability of the QKD protocol. To 
give such an interpretation, they first make the inequality 

v{Pk,Pu) < d < e, 


(29) 


where Pr- is a probability distribution of the real key and 
Pij is that of the ideal key (uniform key). After that, they 
apply the statement (b) of the coupling lemma to give the 
failure probability interpretation. If we dare to ignore the 
physical restrictions and focus only on the mathematical 
possibilities of the coupling lemma, a coupling Pku of 
Pk and Pjj that yields v{Pk,Pu) — Pr{/c ^ m} exists. 
However, we cannot drop physical restrictions. 

The distribution Pk describes the probabilistic behav¬ 
ior of measurement outcomes that are obtained through 
an appropriate POVM for the system of Pke- addition, 
the distribution Pjj is obtained by the same POVM. 
If one supposes that the measurement outcomes of the 
real system and the ideal system are characterized by a 
maximal coupling, it means that measurement outcomes 
from the two distinct systems are correlated. In other 
words, it demands that measurement outcomes in the real 
system depends on that in the ideal system. There is no 
clear reason why one must choose such a coupling and 
why correlation between the real keys and the ideal keys 
is allowed. 

If the equality of the variational distance v{Pk,Pu) 
and the probability Pr{/c ^ m} were established for 
every coupling of Pk and Pu, the failure probability 
interpretation might be justified. But, we have already 
seen that the equality does not always hold. 

Further, we assume here that the measurement out¬ 
comes from the real and ideal systems are independent. It 
would be a natural situation, and is physically acceptable. 
As shown by Shannon Il22ll (p.681), all keys have to be 
equally likely to make one-time pad secure. This is re¬ 
flected to the settings of the ideal probability distribution 
Pu. Indeed, Pu is given to be a uniform distribution. So, 
Pu satisfies the condition 0 < Pu{u) < 1. On the other 
hand, if some sequence k' generated by the QKD protocol 
possesses probability PK{k') = 0 or PK{k') — 1, it is 
clear that the generated sequence does not work as the 
secret key for the embedded one-time pad. Therefore, we 
can assume that every sequence k satisfies the condition 
0 < PK{k) < 1 without loss of generality. But, as shown 
in Corollary to Theorem 3, the strict inequality 

v{PK.Pu)<^^{k^u} (30) 

is established in this case. The juxtaposition of Eas.(l29b 
and dSOl l tells nothing about the relationship between e 
and Pr{fc ^ u). 

Consequently, we have the following facts: 

1) If a maximal coupling of Pk and Pu is employed, 
correlation between the real and ideal keys is 
needed. But, there is no cause one must choose 
a maximal coupling. 

2 ) v{Pk,Pu) < Pr{A: 7 ^ u} holds for every coupling 
of Pk and Pu- That is, v{Pk,Pu) is a lower bound 
of all the possible probabilities Pr{A: ^ rt}. 

3) If the real and ideal keys are statistically in¬ 
dependent, v{Pk,Pu) < Pr{A: 7 ^ u). Clearly, 


v{Pk,Pu) does not mean Pr{A: 7 ^ rt}. 

Let us recall Example 4. In the viewpoint of the appli¬ 
cation of the coupling lemma, we can understand that this 
example illustrates all the facts listed above compactly; 
it immediately shows the fact 3), and by considering the 
“not independent” cases, one would arrive at the facts 
1) and 2). Thus, the use of the coupling lemma justihes 
the Yuen’s claim for the failure probability interpretation 
problem that are repeatedly stated in the literatures IITOll . 
O and its subsequent papers. In summary, we can say 
that £ does not mean probability at all, in particular, it 
does not mean “the failure probability”. 

IV. Conclusions 

This paper is concerned with the failure probability 
interpretation problem that has been presented by Yuen 
and Hirota. To discuss the problem, we used the coupling 
lemma. First, we observed that the variational distance 
does not always mean probability due to the coupling 
lemma. In particular, the analysis of the independent dis¬ 
tribution case showed that the variational distance of such 
a case is not a probability. Applying the coupling lemma 
to the discussions on the failure probability interpretation 
problem, we could justify the Yuen’s claim stated in 
the literatures m, mi and its subsequent papers. As 
a result, we can conclude that the failure probability 
interpretation is not adequate in the security analysis of 
quantum key distribution. 
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Appendix 


A. Coupling of Px and Py 

In this section we treat a case of X 7 ^ V to illustrate the 
mathematical notion of the coupling of two probability 
distributions. 

Suppose that the distributions of X and Y are respec¬ 
tively given as 


Px = (0.10000,0.20000,0.30000,0.40000), 
Py = (0.25000,0.25000,0.25000,0.25000). 

Then we have v{Px, Py) = 0 . 20000 . 


1) Case a (X and Y are independent): If the random 
variables X and Y are independent, then the joint prob¬ 
ability distribution is given as follows. 


/ 0.02500 0.02500 0.02500 
0.05000 0.05000 0.05000 
0.07500 0.07500 0.07500 
\ 0.10000 0.10000 0.10000 


0.02500 \ 

0.05000 

0.07500 

0.10000 j 


With this joint probability distribution, we have 

v{Px,Py) = 0.20000 < 0.75000 = Pr{a; y}. 

This is also an example for Corollary to Theorem 3. Thus 
the variational distance v{Px,Py) behaves as a lower 
bound of the probability Prja; 7 ^ y} when X and Y are 
independent. 


2) Case b (maximal coupling): Using the construction 
procedure shown in Section II, a maximal coupling of Px 
and Py is given by 


p(^) _ 

^XY — 


/ 0.10000 
0.00000 
0.03750 
V 0.11250 


0.00000 

0.20000 

0.01250 

0.03750 


0.00000 

0.00000 

0.25000 

0.00000 


0.00000 \ 
0.00000 
0.00000 
0.25000 ) 


In this case, we have 

v(Px,Py) = 0.20000 = Pr{a; 7 ^ y}, 


which is an example of Theorem 5(b). Observe that this 
coupling demands a strong correlation between X and 
Y. Indeed the following events never happen if X and Y 
obey this coupling; 

event (X = 1) A (U = 2), 
event (AT = 1) A (U = 3), 
event (AT = 1) A (U = 4), 
event (AT = 2) A (U = 1), 
event (AT = 2) A (U = 3), 
event (AT = 2) A (U = 4), 
event (Ai = 3) A (U = 4), and 
event (X = 4) A (Y = 4). 

Thus the half of the possible events never occur. 


3) Case c (not independent, not maximal): When two 
probability distributions are given, there are infinitely 
many couplings in general. Theorem 5(a) covers all of 
the possible couplings, so that no coupling breaks the 
inequality Q. To see this, let us consider the following 
joint probability distribution of Px and Py. 


/ 0.06250 0.01250 0.01250 
0.02500 0.12500 0.02500 
0.05625 0.04375 0.16250 
\ 0.10625 0.06875 0.05000 


0.01250 \ 
0.02500 
0.03750 
0.17500 


This is neither the case of independent random variables 
nor the case of maximal couplings. Even in this case, of 
course, the inequality holds. 


v{Px,Py) = 0.20000 < 0.47500 = Pr{a; 7^ y}. 


This is a typical behavior of the variational distance. 








B. Coupling of PxiX 2 PyxY 2 

This section gives a concrete example of couplings for 
two two-dim probability distributions. 

Suppose that the distributions of {Xi , X2) and (Ti, Y2) 
are respectively given as 

/ 1/3 0 0 \ 

Px,X 2 = 0 1/3 0 , ( 31 ) 

\ 0 01 / 3 / 

and 

/ 1/9 2/9 0 \ 

Py^Y. = 1/9 1/9 1/9 . ( 32 ) 

\ 0 1/9 2/9 ) 

For these distributions, v{PxiX2: PY1Y2) = 5 / 9 . 


1) Case a (maximal coupling): The joint probability 
PX1X2Y1Y2 table I (of the next page) is constructed 
according to the procedure described in Section II. This 
provides a maximal coupling of PxiX2 ^nd PyiY2- In this 
case we have 

5 

v(Px,X2,PyiY2) = g = Pr{(a;i,X2) ^ (2/1,2/2)}- 


2) Case b (Xi = X 2 = Ti is imposed): The joint 
distribution PxIx2YiY2 't* TABLE II (of the next page) 
is designed for satisfying the condition Xi = X2 = Yi. 
Eor this joint distribution, we have 

5 

v{Px^X2,Py^Y2) = g = Pt{(xi,X 2) -h (2/1,2/2)}- 
Observe that 

5 

Pr{(xi,a;2) (2/1,2/2)} = Pr{a;2 2/2} = g- 

Therefore we have 

5 

v{PxiX2,PyiY2) = g = Pr{a;2 2/2}- 

This can also be understood as an example for the case 
of X = X in the textbook ED of Nielsen and Chuang. 


3) Case c ((Xi,X 2 ) and (yi,y 2 ) ore independent): 
Let us consider the case that (Xi,X2) and (yi,F2) are 
independent. The joint distribution PxIx2YiY2 1 °^ 
case is shown in TABLE III (of the next page). As 
expected from Corollary to Theorem 3, we have the 
following strict inequality. 

5 23 

v{Px,X2,Py,Y2) = 9 < ^ =Pr{(xi,a;2) ^ (2/1,^2)}- 

Note that this example also implies that if the condition 
Xi = X2 is only imposed, that is, the condition Xi = Yi 
is removed from the condition Xi = X2 = Yi, then the 
equality of v{PxiX2^ PY1Y2) and Pr{x2 2/2} is not 
guaranteed. 


TABLE I 

Maximal coupling i/ii 2 / 2 ) constructed by the procedure described in Section II. 



^1 = 1 


PT^3 

Y2 = 1 

II 

y 2 = 3 

y2 = 1 

II 

y2 = 3 

y2 = 1 

II 

to 

y 2 = 3 

Xi = I 

X 2 = 1 

1/9 

4/45 

0 

2/45 

0 

2/45 

0 

2/45 

0 

A2 = 2 

0 

0 

0 

0 

0 

0 

0 

0 

0 

X2 = 3 

0 

0 

0 

0 

0 

0 

0 

0 

0 

Xi=2 

X2 = 1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

X2 = 2 

0 

4/45 

0 

2/45 

179 

2/45 

0 

2/45 

0 

X 2 = 3 

0 

0 

0 

0 

0 

0 

0 

0 

0 

Xi=2 

X2 = 1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

X2 = 2 

0 

0 

0 

0 

0 

0 

0 

0 

0 

X 2 = 3 

0 

2/45 

0 

1/45 

0 

1/45 

0 

1/45 

279 ~ 


TABLE II 

Coupling 012,1/1,2/2) under the condition Xi = X2 = Vi 



^1 = 1 

PT ^2 

PT ^3 

y2 = 1 

II 

CO 

II 

y2 = 1 

II 

7 ) 

y 2 = 3 

y2 = 1 

y2 = 2 

y2 = 3 

Xi = I 

X2 = 1 

1/9 

279 

0 

0 

0 

0 

0 

0 

0 

X2 = 2 

0 

0 

0 

0 

0 

0 

0 

0 

0 

X2 = 3 

0 

0 

0 

0 

0 

0 

0 

0 

0 

Xi = 2 

X2 = 1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

X2 = 2 

0 

0 

0 

1/9 

179 

179 

0 

0 

0 

X2 = 3 

0 

0 

0 

0 

0 

0 

0 

0 

0 

Xi = 2 

X2 = 1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

X2 = 2 

0 

0 

0 

0 

0 

0 

0 

0 

0 

X2 = 3 

0 

0 

0 

0 

0 

0 

0 

1/9 

279 


TABLE III 

Coupling Pxlx 2 YiY 2^^''-’^'^’ y '^’ y '^'^ under the conditions (Xi, X2) and (Ti, Ti) are independent. 



Vi = 1 

yi = 2 

Ti=3 

y2 = 1 

y2 = 2 

y2 = 3 

y2 = 1 

y2 = 2 

y2 = 3 

y2 = 1 

y2 = 2 

y2 = 3 

n 

X2 = 1 

1/27 

2/27 

0 

1/27 

1/27 

1/27 

0 

1/27 

2/27 

X2 = 2 

0 

0 

0 

0 

0 

0 

0 

0 

0 

X2 = 3 

0 

0 

0 

0 

0 

0 

0 

0 

0 

Xi = 2 

X2 = 1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

X2 = 2 

1/27 

2/27 

0 

1/27 

1/27 

1/27 

0 

1/27 

2/27 

X2 = 3 

0 

0 

0 

0 

0 

0 

0 

0 

0 

Xi = 2 

X2 = 1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

X2 = 2 

0 

0 

0 

0 

0 

0 

0 

0 

0 

X2 = 3 

1/27 

2/27 

0 

1/27 

1/27 

1/27 

0 

1/27 

2/27 





































































































































































































































































































